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Question 4 


Intent of Question 


The goals of this question are to assess a student’s ability to: (1) plot a least squares regression line; (2) examine a 
residual; and (3) discuss the effect of an additional observation on an estimated correlation coefficient and on the 


least squares estimate of the slope of a line. 


Solution 


Parts (a) and (b): 





.../.| new point 


Height of Woman 





When x= 67, y = 35.1 + 0.427(67) = 63.709 
and the residual = y- y = 61 — 63.709 = —2.709. 





Part (c): 


See the new point indicated in the plot above. The slope would remain about the same since the new point 
is consistent with the linear pattern in the original plot (i.e., close to the line). 


S,, ‘ series 
The correlation coefficient would increase. We know that b = r—. The added point will increase s, 
s 


x 


s y . . . 
more than it will increase 5, so — will be less than 1. If the slope is to stay the same, r must increase. 
Ss 


x 


OR 
This point fits the pattern well and has an x value that is far from x. 
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Question 4 (continued) 


Scoring 


This problem is scored in 4 sections. Section | consists of the graphical parts of (a) and (b) together. Section 2 
consists of the numerical parts of (b). Section 3 consists of the first part of (c). Section 4 consists of the second 
part of (c). 


Each section is scored as either essentially correct (E), partially correct (P), or incorrect (1). 


Section 1 (graphical parts of a and b) is essentially correct (E) if: 
1. the regression line is drawn correctly on the scatterplot; 
2. the point (67, 61) is circled and the vertical segment corresponding to the residual is drawn on the 


scatterplot. 
Section | is partially correct (P) if the response includes one of the above two elements. 


Section 2 (numerical part of b) is essentially correct (E) if the residual is correctly computed as —2.709; 

OR 

the response states that the residual was approximated using the graph, a reasonable value for the residual is 
given, and the sign of the residual is correct. 


Section 2 is partially correct (P) if the magnitude of the residual is correct but the sign is wrong. 


Section 3 (first part of (c)) is essentially correct (E) if it: 
1. states that the slope will remain about the same (or change slightly); 
2. provides an explanation based on the new point fitting the pattern in the original plot. 


Section 3 is partially correct (P) if it states that the slope will be about the same, but the explanation is missing or 
incorrect. 


NOTE: If the line is drawn incorrectly in part (a), and the answer to this part is consistent with the line drawn, 
section 3 is essentially correct (E). 


Section 4 (second part of (c)) is essentially correct (E) if it: 
1. states that the value of the correlation coefficient will increase; 
2. provides an explanation based on the relative changes in s, and s, 
OR 
based on the fact that the new point fits the pattern AND is far out in the x direction, 
OR 
because the linear pattern is stronger. 


Section 4 is partially correct (P) if it states that the value of the correlation coefficient will increase, but the 
explanation is missing or incorrect. 


NOTE: If the response just says that the correlation coefficient will increase because the point is close to the line, 
section 4 is partially correct. 
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Question 4 (continued) 
4 Complete Response 
All four sections essentially correct 
3 Substantial Response 


Three sections essentially correct and no sections partially correct 


OR 
Two sections essentially correct and two sections partially correct 
2 Developing Response 
Two sections essentially correct and no sections partially correct 
OR 
One section essentially correct and two sections partially correct 
OR 
Four parts partially correct 
1 Minimal Response 
One section essentially correct and no sections partially correct 
OR 


No sections essentially correct and two sections partially correct 


If a response is between two scores (for example, 212 points), use a holistic approach to determine whether 
to score up or down depending on the strength of the response and communication. 
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4Al 


4. Each of 25 adult women was asked to provide her own height (y), in inches, and the height (x), in inches, of her 
father. The scatterplot below displays the results. Only 22 of the 25 pairs are distinguishable because some of the 
(x,y) pairs were the same. The equation of the least squares regression line is ¥ = 35.1 + 0.427x. 


Height of Woman (inches) 





aren of Father ae 


(a) Draw the least squares regression line on the scatterplot above. 


des! part of this page is illegal. GO ON TO THE NEXT PAGE. 
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4A2 


(b) One father’s height was x = 67 inches and his daughter’s height was y = 61 inches. Circle the point on the 
scatterplot on the preceding page that represents this pair and draw the segment on the scatterplot that 
corresponds to the residual for it. Give a numerical value for the residual. 


ctSidual = y-4re) = (25,1+.429-C@?) )5 Gl- G3, 209 = — 2,704 ‘noher 


aH - Polat 
~~ (c) Suppose the point x = 84, y = 71 is added to the data set. Would the slope of the least squares regression 


line increase, decrease, or remain about the same? Explain. 


(Note: No calculations are necessary to answer this. question.) 


‘The stope. Would temeun Frc seme lccowse He residual nWereis wey 
‘ASIgMiBeant and the point Cby, HW) aWwrett Falig of the exiskrg 
liee, thes the (newovid not havete accomodate toy Mowing , 


Would the correlation increase, decrease, or remain about the same? Explain. 


(Note: No calculations are necessary to answer this question.) 


The cosreatmn wevld_inertase because the poink is inflwenval, 
Ir Moregscs the “scope of thy leastsquoes regression line toy 
providing & point rier tht high endef the domain Cor which Me 
Pridietion te Aceurate, vuaning thot this ting ON reasonably he 
applied Por al larger rOongs of volyes, This onarty 1a, pomraiple 
“strengthens” Tre eorrelaten ondmateas if cleserte L 


any part of this page Is ilegal. GO ON TO THE NEXT PAGE. 
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4Bl 


4. Each of 25 adult women was asked to provide her own height (y), in inches, and the height (x), in inches, of her 
father. The scatterplot below displays the results. Only 22 of the 25 pairs are distinguishable because some of the 
(x,y) pairs were the same. The equation of the least squares regression line is } = 35.1 + 0.427x. 
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(a) Draw the least squares regression line on the scatterplot above. — 


Paani | 
any part of this page is illegal. GO ON TO THE NEXT PAGE. 
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4B2 


(b) One father’s height was x = 67 inches and his daughter's height was y = 61 inches. Circle the point on the 
scatterplot on the preceding page that represents this pair and draw the segment on the scatterplot that 
corresponds to the residual for it. Give a numerical value for the residual. 


ui-[oone) = CED 


(c) Suppose the point x = 84, y = 71 is added to the data set. Would the slope of the least squares regression 
line increase, decrease, or remain about the same? Explain. 


(Note: No calculations are necessary to answer this question.) 
Tu slope of Fk jeasr Sgueres pegs Iino would 


yeyrein about th sane because the petnr Cea 5 a 
Close, the \east” squares Vegressvor Vike. 


Would the corfelation increase, decrease, or remain about the same? Explain. 


(Note: No calculations are necessary to answer this question.) 


Tl correlation would also vemain about tHe Sone 
because correlation ££ He Slone of the leas+ 


S5uares  vegvession Ime are cxacth 7 sane 
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Y = pers 


4. Each of 25 adult women was asked to provide her own height (y), in inches, and the height (x), in inches, of her 
father. The scatterplot below displays the results. Only 22 of the 25 pairs are distinguishable because some of the 
(x,y) pairs were the same. The equation of the least squares regression line is ¥ = 35.1 + 0.427x. 






Height of Woman (inches) 


Height of Father (inches) 


(a) Draw the least squares regression line on the scatterplot above. 


vs 
——— GO ON TO THE NEXT PAGE. 
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4C2 
Cer, t') 
(b) One father’s height was x = 67 inches and his daughter’s height was y = 61 inches. Circle the point on the 
scatterplot on the preceding page that represents this pair and draw the segment on the scatterplot that 
corresponds to the residual for it. Give a numerical value for the residual. 


residual = Actual - Yreckicteck 


rtsidual&~ \ 


Cou H) 

/ 

(c) Suppose the point x = 84, y = 71 is added to the data set. Would the slope of the least squares regression 
line increase, decrease, or remain about the same? Explain. 


(Note: No calculations are necessary to answer this question.) 
This point may be intluential, which would change 
We slope, however his poms seems do Follow the 


CFE ING Wwe ar patttn, ¢o XT otontt Wink 
WAL Slope will Charge 400 much, 


Would the correlation increase, decrease, or remain about the same? Explain. 
(Note: No calculations are necessary to answer this question.) 
Corre lation witl increase lecavse rr is 
non- vesichunt +O extveme values, Ypowever 
Xhis povri may rer be consi {reat an 
Oxtveme value to Some, 


SEL Ee 1- GO ON TO THE NEXT PAGE. 


©2007 The College Board. All rights reserved. 
Visit apcentral.collegeboard.com (for AP professionals) and www.collegeboard.com/apstudents (for students and parents). 


AP® STATISTICS 
2007 SCORING COMMENTARY (Form B) 


Question 4 


Sample: 4A 
Score: 4 


This is a complete response that accurately presents the least squares estimate of the regression line on the plot, 
circles the point corresponding to a woman of height 61 inches who has a father of height 67 inches, and inserts a 
vertical line segment corresponding to the residual for that point. The residual is correctly calculated in part (b). 
The response to part (c) correctly explains that the additional point would have little effect on the least squares 
estimate of the regression line because it is close to the existing regression line. The response to part (d) provides 
a reasonable support for the conclusion that the additional point would increase the estimated correlation. 


Sample: 4B 
Score: 3 


This is a substantial response that accurately presents the least squares estimate of the regression line on the plot, 
circles the point corresponding to a woman of height 61 inches who has a father of height 67 inches, and inserts a 
vertical line segment corresponding to the residual for that point. The residual is computed correctly in part (b). 
The response to part (c) explains that the additional point would have little effect on the least squares estimate of 
the regression line because it is close to the existing regression line. The response to part (d) incorrectly argues 
that the correlation would stay about the same because the additional point would have little effect on the slope of 
the regression line. 


Sample: 4C 
Score: 2 


This is a developing response that reflects less than complete understanding in several parts. The least squares 
estimate of the regression line is not presented accurately on the plot. The point corresponding to a woman of 
height 61 inches who has a father of height 67 inches is circled, and a vertical line segment is used to indicate the 
residual for that point. The magnitude of the residual reported in part (b) is consistent with the line displayed in 
part (a), but the numerical value should be negative. The response to part (c) begins with a general statement, but 
it eventually indicates that the additional point would have little effect on slope of the least squares estimate of the 
regression line because it is close to the existing regression line. The response to part (d) indicates that the 
correlation would increase, but the explanation is poor and somewhat contradictory. 
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